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Elongated trimeric adhesins are a distinct class of proteins employed by phages and viruses to recognize and bind to 

their host cells, and by bacteria to bind to their target cells and tissues. The tailspikes of E coli phage K1 F and Bacillus 

phage 029 exhibit auto-chaperone activity in their trimeric C-terminal domains. The P22 tailspike is structurally 

homologous to those adhesins. Though there are no disulfide bonds or reactive cysteines in the native P22 tailspikes, a 

set of C-terminal cysteines are very reactive in partially folded intermediates, implying an unusual local conformation in 

the domain. This is likely to be involved in the auto-chaperone function. We examined the unusual reactivity of 

C-terminal tailspike cysteines during folding and assembly as a potential reporter of auto-chaperone function. Reaction v_ ^ 

with lAA blocked productive refolding in vitro, but not off-pathway aggregation. Two-dimensional PAGE revealed that 

the predominant intermediate exhibiting reactive cysteine side chains was a partially folded monomer. Treatment with 

reducing reagent promoted native trimer formation from these species, consistent with transient disulfide bonds in the 

auto-chaperone domain. Limited enzymatic digestion and mass spectrometry of folding and assembly intermediates 

indicated that the C-terminal domain was compact in the protrimer species. These results indicate that the C-terminal 

domain of the P22 tailspike folds itself and associates prior to formation of the protrimer intermediate, and not after, as 

previously proposed. The C-terminal cysteines and triple p-helix domains apparently provide the staging for the correct 

auto-chaperone domain formation, needed for alignment of P22 tailspike native trimer. 



Introduction 

The elongated adhesin proteins that govern the attachment of 
many viruses and phages to cells use their extended lateral surfaces 
to recognize cell surface polysaccharides and lipopolysaccharides. 
Examples include the Adenovirus penton fiber, T4 short tail 
fiber, Bordetella pertussis toxin, the bacteriophage associated 
Hyaluronate lyase (Hylp2), and a variety of phage tailspikes. ^'^ 
The majority of these proteins are multimeric, folding and 
assembling into very stable structures that can survive the diverse 
environments they are exposed to.^ The structures of three 
bacteriophage tailspikes have been determined to high resolution 
(Fig. 1): Salmonella P22 tailspike. Bacillus 029 appendage 
(tailspike), and E. coli KIF tailspike. ^^'^^ These three proteins 
have major [3-helix domains, with triple [3-helix regions, and 
complex trimeric C-terminal domains. Among these three 
proteins, the folding, assembly and off-pathway aggregation 
pathways of the Salmonella P22 tailspike have been characterized 
both in vivo and in vitro. They include monomeric, dimeric and 
protrimer partially folded intermediates. ^^"^^ 



Many phage structural proteins require chaperones to assist in 
folding and assembly. The first identified chaperone, GroEL/ES 
was found associated with phage }i capsid morphogenesis.^^ Other 
early reports of chaperone that aid folding and assembly are 
associated with phage T4 head assembly,^^ T5 tail assembly,^ ^ and 
P22 coat protein folding. ^^'^^ 

The initial efforts to identify a chaperone for P22 tailspike 
folding and assembly were unsuccessful,^^ and the ability of 
the purified fully denatured chains to refold and assemble the 
native biologically active state in vitro argued against such a 
function. ^^-^^'^^ 

Recent reports reveal that KIF and 029 tailspikes utilize an 
intra-molecular chaperone domain (IMC). An emerging class of 
chaperones are the IMC/auto-chaperone segments of proteins that 
are necessary for the efficient folding and assembly of their own 
chains. They were initially identified as N-terminal pro-pep tides 
or pro-sequences needed for maturation of exported proteases 
such as Subtilisin.^^ Chen and Inouye refer to this class as type I.^^ 
Another class, type II, is represented by the C- and N-terminal 
pro-peptides of collagen in which the pro-peptides align and 
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Figure 1. Structures of the C-terminal and auto-chaperone domains in Salmonella phage P22 tailspil<e and homologous proteins. (A) The structure 
of Salmonela phage P22 tailspike (PDB 1TYU). C-terminal domain remains in mature protein. (B) The structure of Bacillus phage 029 gp12 appendage 
(PDB 3GQ7). C-terminal domain is cleaved off after folding completion. (C) The structure of E.coli phage K1F tailspike (endoNF) (PDB 1V0E). C-terminal 
trimeric domain with long tentacles assists folding, and is cleaved off in the mature protein (3GUD, 3GW6). 



tether the strands before the collagen triple helix is formed. These 
registration sequences are subsequently cleaved off^^ Recently, 
non-cleaved type II IMC-like domains have been reported in 
[3-helical trimeric auto transporter proteins. The auto transporter 
Pertactin protein of Bordetella, folds the C-terminus domain in 
the outer membrane, creating a channel for the rest of the chains 
to move through, and serving as the template for the folding of 
the N-terminal "passenger" domain.^^'^^ 

Recent studies of the assembly of tailspikes of the Bacillus 
subtilis phage 029 and E. coli phage KIF have shown the 
processes to be governed by their C-terminal domains. ^^'^^ The 
crystal structures of these adhesins are close analogs to the tailspike 
of the Salmonella bacteriophage P22 (Fig. 1). 

The phage P22 tailspike is predominantly [3-sheet (Fig. 2).^^'^^ 
The N-terminal domain (residues 1—124) is comprised of anti- 
parallel [3-sheets forming a trimeric mushroom or dome-like 
structure that is required for binding to the phage capsid. The 
major structural domain (residues 143-539) is comprised of 13 
rungs of parallel (3-sheets in a [3-helical conformation (Fig. 2A 
and C). The length of the domain is presumably for the binding 
and cleavage of the lipopolysaccharide on the host cell surface. ^^'^^ 
The three (3-helical domains are bound through predominantly 
hydrophilic interfaces. 

After the 13th rung the parallel [3-helix motif terminates and 
the three chains twist around each other and intertwine to form 



an interdigitated triple [3-helix domain (residues 540-546) 
(Fig. 2C). The chains (residues 547-612) then separate to form 
3 sides of a triangular [3-prism, essentially an oligomeric left 
handed [3-helix. The subunit interfaces constitute a single buried 
hydrophobic core (Fig. 2C). The chains then twist by a loop-short 
a-helix-loop that put three pairs of cysteines — C613 and C635 — 
in a distinctive even plane, forming a ring-like conformation, 
termed the cysteine annulus (Fig. 2B and C). The tailspike ends 
in a triple bladed motif termed the caudal fin. 

The structures of the tailspikes of Bacillus phage 029, and 
E. coli phage KIF share similarities with the P22 tailspike (Fig. 1). 
The large structural domain region contains the elongated binding 
and catalytic domain, [3-helical for P22 and 029 (Fig. lA and B), 
and for KIF, three [3-propellers and a [3-barrel (Fig. IC). Further 
along is the shaft region with different [3-sheet structures: a triple 
[3-helix in the P22 tailspike (Fig. lA); a triple [3-prism for 029 
(Fig. IB), and have similar elements to the a triangular [3-prism/ 
triple [3-helix of KIF (Fig. IC).^"'^^ 

These three tailspikes have a trimeric bladed configuration in 
their C-terminal domains (Fig. 1, bottom panels). The tailspikes 
of 029 and KIF have "tentacles" that extended up to the triple 
[3-helix shaft. P22 tailspike lacks these tentacles. 

When the 029 appendage and KIF tailspike are expressed 
without the C-termini, the resulting tailspike chains aggre- 
gated. ^^'^^ 029 and KIF mutants that retain the C-termini 
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Figure 2. Structure of P22 tailspike with cysteine locations, and schematic of the folding and assembly pathway. (A) P22 tailspike single chain crystal 
structure with the N-terminal head domain deleted. Yellow spheres indicate cysteine residues. (B) A cross-section of the native trimer cysteine annulus 
bottom to top view. The three individual subunits are colored red, blue and green. Two cysteines from each subunit (C613 and C635) are located 
in this region. (C) The N-terminal head binding domain deleted native trimer showing the eight cysteines (PDB 1TSP). The P22 tailspike major structural 
domains are parallel p-helical domain (109-539), interdigitated triple p-helix domain (540-555), triangular p-prism and triple blade motif structure 
(556-666). All eight cysteines are reduced in the naive structure.^^ (D) Schematic diagram of intermediates in the in vitro refolding, assembly, and 
aggregation of tailspike chain. U: unfolded polypeptides. Nj: native trimer. The folding pathway depicts intermediates and disulfide bonds distributions: 
The state of the reactive cysteines in the [M] and [M*] species are described in the Results section. Dimeric intermediate: [D] and protrimer intermediate: 
[Pj] have been reported to contain inter-disulfide bonds: [-S-S-]. All disulfide bonds are reduced: [-SH] in native trimer. The aggregation prone 
intermediates are marked as [*]. No disulfide bonds have been reported in aggregation species. 



domain trimerize successfully. These results show that the 
C-termini domains are important for folding and assembly. 

The C-termini of the 029 and KIF chains acting as IMC/ 
autochaperones raise the question of the role of the C-terminal 
trimeric domain in the folding and assembly of the P22 tailspike. 
Unlike 029 and KIF, the P22 C-terminal domain is not cleaved 
after native trimer folding. However, the isolated C-terminal 
domain (residues 537-666) of P22 tailspike can function as 
an independent oligomerization domain. When three maltose 
binding protein were tethered to a trimeric P22 tailspike 
C-terminal domain, the chimera chains assembled into a P22 
tailspike-like trimer. This suggests a type II IMC function is 
exhibited by the C-terminal domain of P22 tailspike. 

In earlier searches for cellular chaperones for P22 tailspike 
folding and assembly, infected cells were treated with iodoaceta- 
mide (lAM) with the intent to block ATP synthesis, in the hope 
of trapping chaperone complexes. lAM did indeed result in 
blocking intracellular tailspike folding and assembly. However, it 
turned out to be due to direct reaction of the lAM with reactive 
C-terminal cysteines in tailspike folding intermediates.^^ These 
cysteine thiols are not reactive in the native state, and in fact are 
strongly hydrogen bonded.^^ 



The in vivo and in vitro refolding pathway of the P22 tailspike 
proceeds through a number of folding intermediates (Fig. 2D). 
The polypeptide chains emerge from the ribosome or out of the 
chaotropic agent and proceed to form a single chain partially 
folded intermediate, in which the (3-helical domain is somewhat 
structured. The intermediate further folds to form a species 
competent for chain/chain association yielding the metastable 
protrimer. The protrimer then proceeds to the mature thermo- 
stable native tailspike trimer. ^^'^^'^^'^'''^^ 

The partially folded monomeric intermediates are very sensitive 
to temperature, and shift from productive folding to aggregation 
a few degrees above physiological temperature. ^^'^^'^^ As a result of 
this thermolability of the early folding intermediate, the chain is 
the locus of many temperature sensitive folding (tsfj mutants. "^^'"^^ 

The metastable protrimer is trapped in the cold, and can be 
distinguished from the native state by its retarded migration in 
native-PAGE. The protrimer has not acquired the detergent, 
thermal or protease resistance, found in the native state. ^^'^^ The 
protrimer can be converted into the native state in the absence of 
exogenous proteins or factors. 

Robinson and King reported that the protrimer intermediate 
formed in vitro contains transient inter-chain disulfide bonds. 



38 



Bacteriophage 



Volume 2 Issue 1 



which must be reduced during maturation to the native state.^^ 
The addition of reducing reagents to P22 tailspike folding 
intermediates retarded mobility of monomeric intermediate 
through native-PAGE, also suggesting the presence of intra-chain 
disulfide bonds, presumably formed from the reactive cysteines. '^'^ 

The unusual reactivity of the C-terminal cysteine thiols may be 
due to interactions with side chains or backbone atoms within 
some distinctive local conformation, as seen in the reactive site of 
cysteine proteases. The thiol group of cysteine is deprotonated by 
nearby basic residues such as histidine before starting cleavage."^^'"^^ 
Similarly the reduction of disulfides, formed during folding or 
assembly, may be performed by local side chain, or backbone 
interaction within folding intermediates. Disulfide exchange 
between intra- and inter-disulfide bonds among P22 tailspike 
folding intermediates support this hypothesis. 

The presence of disulfide bonds in intermediates, yet missing 
from the native state has been carefully studied for bovine 
pancreatic trypsin inhibitor. ^^'^^ Similarly, the registration peptide 
of collagen utilizes transient intra-disulfide bonds for nucleation 
of C-terminal domain to stabilize the chain association stage. ^^'^^ 
The collagen zipper-like trimer folding proceeds toward the 
N-terminal domain with using inter-disulfide bonds formation. 
At the end of trimer folding, both N-terminal and C-terminal 
domains are processed by procollagen peptidase. 

The recent identification of the auto-chaperone function in 
phage 029 and KIF adhesins suggests that the P22 tailspike 
cysteine annulus may be acting to organize and get in register the 
tailspike chains for protrimer assembly. In that case one might 
expect mutants of these cysteine residues to retard folding, 
without absolutely blocking the overall reaction. Haase-Pettingell 
et al. showed that the cysteine to serine mutants in vivo retarded 
the rate of formation of native tailspike. The single serine 
mutations for each C-terminal cysteine decreased folding yields 
and kinetics, but did not completely inhibit trimer folding. 
However, double mutations of both cysteines for serine residues 
C613 and C635 sharply lowered the yield of native trimer."^^'^^ 

The homologous tailspike from Det7 of Salmonella enterica 
lacks cysteines in this region, Seckler and coworkers have sug- 
gested that the transient disulfide bonds are not physiological 
but an artifact of the relatively slow in vitro refolding process. If 
they do not form disulfide bonds, the in vivo reactivity to lAA 
(iodoacetic acid) nonetheless indicates an unusual state of cysteine 
residues during folding and subunit assembly. This conformation 
dependent reactivity is likely to be a reporter or surrogate for the 
conformation of the C-terminal portion of the chain needed to 
carry out its auto-chaperone function. 

As noted above, the unusual reactivity of at least two of the 
eight tailspike cysteines during folding, presumably represents 
activation by local interactions in some of the partially folded 
species. If the cysteine annulus motif represents an auto- 
chaperone function of this domain, it should form early in 
folding rather than later, as we previously proposed. ^^'^^ The 
reactivity of those cysteines is likely to serve as a reporter for 
folding of this region of the chain. Here we examine more 
carefully the reactivity of the cysteine residues during refolding 
in vitro, following conformational intermediates separated by 



native-PAGE, and using mass spectrometry to identify folding 
and assembly intermediates with reactive cysteines thiols. The 
results suggest that the conformation maintaining the cysteine 
thiols in a reactive state forms very early in folding, prior to chain 
association, and that our earlier model needs to be revised. 

Results 

Which intermediates in the folding and assembly pathway 
exhibit unusual cysteine reactivity? Previous work has shown that 
in vitro refolding of P22 tailspike proceeds through a series of 
folding and aggregation intermediates. Spectroscopic techniques 
have not resolved the monomeric and oligomeric productive 
intermediates or the multimeric off-pathway species. Though 
native-PAGE is not a traditional method for separating con- 
formational intermediates, the 666 residues of the tailspike chain, 
and the complexity of the folding and aggregation pathway have 
made it preferable to other biophysical techniques (Fig. 3).*^°'^^ 
Utilizing native-PAGE, Betts and King resolved intermediates 
species on the productive pathway — two partially folded mono- 
meric species Mslow MfasTj partially folded dimeric inter- 
mediates, protrimer intermediate, and of course the native trimer 
(Fig. 3A).^^ At higher temperatures, the refolding chains partition 
to multimeric aggregation intermediates, which have slightly 
different electrophoretic mobility from the related productive 
intermediates (Fig. 3B). At intermediate temperatures both pro- 
ductive and off-pathway intermediates are populated (Fig. 3C). At 
0-4°C (Fig. 3A), two partially folded monomeric species Mslow 
and MpAST are precursors to both productive pathway inter- 
mediates and non-productive off-pathway aggregation intermedi- 
ates.^^ The complexity of the gel patterns is not an artifact of the 
method, but captures the population of multiple intermediates 
required for the chains to reach both the native trimer state, or 
aggregated inclusion body state.^^ '^^'^^ 

All eight cysteines are reduced and completely unreactive in 
the native tailspike. ^^'"^^ The inhibition of tailspike folding by the 
in vivo labeling of C-terminal cysteines with radioactive iodo- 
acetamide (lAM), as well as the characterization of intra-disulfide 
bond in vitro refolding intermediates, did not unambiguously 
identify which intermediates exhibited redox-active cysteine 
thiols. ^^'^^ To identify the partially folded in vitro intermediates 
with the reactive cysteines thiols, we utilized alkylation with 
lAA. P22 tailspike at 2.0 mg/mL was unfolded in acid urea pH 3, 
and refolding was initiated by dilution to 0.1 mg/mL into ice cold 
refolding buffer, 100 mM Tris, 2 mM EDTA, pH 7.6 for 15 min 
(residual concentration of urea 0.8 M).'^^ Then the temperature 
was shifted to 24°C for further refolding and samples taken at 
various times. For the initial experiments, tailspike was refolded 
with or without lAA. 

As noted above, the complexity of the band pattern represents 
the presence of both productive and off-pathway intermediates 
within the samples (Fig. 4A). The kinetic species resolved in this 
gel were monomeric intermediates, dimeric intermediates, native 
trimer, protrimer intermediate, as well as trimeric, and multimeric 
aggregates. In the left lanes at early times, the partially folded 
monomeric and dimeric intermediates predominate. Note that 



www.landesbioscience.com 



Bacteriophage 



39 



A B 



C D 



Protrimer 
Native trimer 
Dimeric 



Ms 




Aggregates* 
<- Trimeric* 

^Dimeric* 



4°C 37°C 



^1^1 Aggregates* 

^^Protrimer 

^5 .^r^Trimeric* 



r Dimeric 




Native trimer 
Dimeric 
Dimeric* 



20X 



Figures. During the refolding of P22 tailspike, 
productive folding intermediates and non-productive 
nnultimeric aggregation, can be resolved with 
Native-PAGE. (A) At 4°C productive intermediates 
predominate as visualized by western blot analysis of 
a Native-PAGE, probed by a mixture of monoclonal 
antibodies recognizing non-native epitopes. (B) At 37°C 
non-productive folding intermediates and multimeric 
aggregation are detected by the same western blot 
analysis in (A). (C) At 20°C both productive and 
non-productive species are observed as seen in this 
silver stain of a Native-PAGE. (D) The graphic depicts the 
bands, the productive folding intermediates and the 
non-productive species from the gel in (C). Black labeled 
bands are productive species, gray and italic labeled 
bands are non-productive species. (A and B) were 
adapted from Betts and King 1998.^' 



due to the time taken to load the native gels, and initiate 
electrophoresis, the "0" time sample has incubated for additional 
minutes, accounting for the presence of multiple intermediate 
species. After 60 min of refolding at 24°C (right-hand lane), 
significant amounts of native trimer has accumulated. In native 
gels the productive protrimer forms the trailing edge of the off 
pathway trimeric aggregation intermediate band and is not fully 
resolved as seen in Figure 3C. 

In Figure 4B, lAA has been added to the folding reaction 
at various times of incubation — 0, 5, 30 and 60 min — 
corresponding to the samples in Figure 4A. These samples were 
further incubated with lAA as indicated in the top of Figure 4B 
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Figure 4. In vitro refolding of P22 tailspike with or without iodoacetic 
acid (lAA). The monomeric species were labeled and stalled by alkylation 
treatment. P22 tailspike was denatured and refolded as described 
in Materials and Methods. At indicated times, lAA was added to stall 
folding reaction and analyzed by non-reducing 1-D native-PAGE. 
(A) In vitro refolding at 24°C shows the native trimer formation as 
a function of the refolding time. (B) Samples were made up to 30 mM 
lAA at indicated times and incubated for total 60 min at 24°C. lAA 
reacted monomeric species that aggregated, remained at the top of gel. 



[Refolding time (min) + lAA labeling time (min) to equal a total 
of 60 min] . Reaction of lAA with refolding tailspike and further 
incubation at 24°C sharply reduced the formation of native 
tailspike (Fig. 4B, 0 time). Though failing to yield native ^ 
tailspikes, the samples exposed to lAA at early times accumulated ^ 
higher molecular weight oligomeric aggregation species, visible at 
the top of the gel. In addition, the band corresponding to partially 
folded monomeric intermediate was not depleted upon incuba- 
tion and appeared to accumulate. The simplest interpretation is 
(1) alkylated chains are unable to proceed through the productive 
pathway to the native state as found in vivo;^^ (2) polymerization 
to high molecular weight off-pathway aggregates is not inhibited; 
and may be promoted; in the presence of lAA. 

P22 tailspike folding intermediates which have reacted with 
lAA should have an additional net charge due to the negatively 
charged carboxymethylated group. These modified chains can 
be expected to be fractionated from the unreacted chains. 
Two-dimensional (2-D) native-PAGE was employed in this 
fractionation. 

During 4°C incubation, partially folded monomers, dimers, 
and protrimer intermediates accumulated due to suppression 
of aggregation at low temperature.^^ To selectively observe lAA 
labeled intermediates, labeling and subsequently non-reducing 
2-D native-PAGE were performed under the 4°C condition 
(Fig. 5). The gel slice on the top of the gel is the lane of the first- 
dimensional (1-D) native-PAGE separation placed horizontally 
to show the species present in the refolding reaction after 3 h at 
24°C in the absence of lAA. For the samples of Figure 4B, 
30 mM lAA was added after 2 h refolding, and incubated for 
an additional 1 h at 4°C. Comparison of the 1-D pattern reveals 
that the formation of the native trimer was clearly inhibited, and 
monomeric species have accumulated (Fig. 5B, top gels). In 
Figure 5 after separation through in the 1-D native-PAGE, the 
gel lanes were excised and put on a non-reducing 2-D native gel 
(in this case without further treatment), and electrophoresed in a 
second dimension producing the observed diagonal pattern. 

In Figure 5A, the most intense spot in the middle of the 
diagonal line corresponds to native trimer (Nt). The species with 
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Figures. lAA treatment altered monomeric tailspike intermediates. 
(A) Non-reducing 2-D native-PAGE of refolded P22 tailspike without lAA 
labeling. P22 tailspike was denatured, refolded for 2 h at 24°C, and then 
additionally incubated for 1 h at 24°C without lAA. (B) Non-reducing 2-D 
native-PAGE of refolded P22 tailspike with lAA labeling. P22 tailspike was 
denatured, refolded for 2 h at 4°C, and then additionally incubated 
for 1 h at 4°C with 30 mM lAA. Nj, Pj, and D indicate native trimer, 
protrimer, and dimeric intermediates. Mslow and Mfast refer to electro- 
phoretically slower and faster migrating monomeric intermediates. 



retarded migration in the upper left of the gel represent trimeric, 
and higher order aggregates. On the lower right on the diagonal 
line are two monomeric intermediates, slower migrating mono- 
meric Mslow species and faster migrating monomeric Mfast 
species. 

In Figure 5B, with the applied sample lane on top, refolding 
samples treated with lAA were electrophoresed in the first 
dimension. The inhibition of native tailspike formation and 
accumulation of slower migrating monomeric Mslow intermedi- 
ate was clear, in comparison with untreated control (Fig. 5A, top). 
The fractionated bands were then electrophoresed through the 
2-D native-PAGE. While most bands were on the diagonal, 
additional spots migrated below the diagonal line indicating 
increased charge, or compactness (Fig. 5B). One of the intense 
spot migrated with the same mobility as faster migrating mono- 
meric MpAST species in 1-D native-PAGE. The faster migrating 
species has the negatively charged carboxymethylated group 
introduced by the lAA, which may account for the change of 
migration, or the reaction with the thiol may have altered the 
conformation and thus the migration. 

Regardless of the precise origin of the altered mobility, these 
results confirmed that cysteine side chains with unusual chemical 
reactivities occur in a monomeric folding intermediate in the 
pathway. These results are similar to previously reported by 
Danek and Robinson for tailspike monomeric complex treated 
with oxidized glutathione.^^ Given that the cysteine reactivity 
must depend on some distinct chain conformation, this sensitive 
precursor may represent species with the C-terminal chaperone 
domain folded into a competent conformation. 

Distinguishing monomeric precursors in the productive and 
aggregation pathways. Systematic study of temperature sensitive 



folding {tsfi mutants of the P22 tailspike established that the 
folding defect destabilizes a thermolabile partially folded inter- 
mediate leading to off-pathway aggregation.^^'^^'*^*^ Since chaper- 
ones typically function to suppress off-pathway aggregation, we 
wanted to examine the relationship of the two classes of partially 
folded monomeric intermediates to the productive and aggrega- 
tion pathways. 

In order to investigate productive and non-productive mono- 
meric intermediates, non-reducing 1-D native-PAGE and non- 
reducing 2-D native-PAGE were employed. P22 tailspike was 
denatured, refolded at lower temperature without additional lAA 
to observe free cysteine side chain interactions, and applied to 
non-reducing 1-D native-PAGE. To follow the effects that tem- 
perature had on the monomeric intermediates, the 1-D native 
gel slice was incubated in vitro at 24°C restrictive temperature 
for 0 h, 1 h and 8 h. All gels were stored on wet ice to halt 
folding until applied to the non-reducing 2-D native-PAGE 
(Fig. 6). For clarity a graphic depiction of the 1-D gel is 
positioned on the 2-D gel. 

Both slower migrating monomeric Mslow faster migrating 
monomeric Mfast intermediates were observed on a diagonal in 
the non-reducing 2-D native gel (Fig. 6A: "8 hours on ice"). The 
arrow points to an additional faster migrating monomeric Mfast 
intermediate below the Mslow (Fig- 6A, arrow). This shifted 
monomeric species below the diagonal line may be produced by 
intra-disulfide bond formation, or folding compaction of slower 
migrating monomeric Mslow intermediates. After 1 h incubation 
at 24°C sample, the slower migrating monomeric Mslow band 
decreased while the intensity of faster migrating monomeric 
Mfast increased (Fig. 6B, "1 hour at 24°C," arrow). After 8 h 
incubation at 24°C, no Mslow or Mfast monomeric inter- 
mediates were visible (Fig. 6C, "8 hours at 24°C"). We concluded 
that all monomeric species entered the aggregation pathway and 
were unable to enter the 2-D native gel. It appears that the slower 
migrating Mslow intermediate is either intra-disulfide bonded, 
or collapsed to form faster migrating Mfast species, then pro- 
ceeded down the aggregation pathway. There was little evidence 
for thiol reactivity among the higher molecular weight multimeric 
intermediates, confirming previous observations. ^^'^^ 

Which reactive cysteine-containing intermediates partition to 
the folding pathway? The prior experiments indicated that all 
partially folded monomeric intermediates upon extended incuba- 
tion eventually shifted onto the aggregation pathway. However, 
it is evident that under the refolding conditions one of the 
monomeric species is the kinetic productive intermediate and 
the other is the off-pathway aggregation precursor. In the auto- 
chaperone model, the chain that had reached the required 
C-terminal conformation would be productive. Because the state 
of the reactive cysteines can be a report on this conformation, we 
would expect different reactivities between the productive and 
off-pathway intermediates. To this end, we employed non- 
reducing and reducing 2-D native-PAGE to investigate the state 
of the reactive thiol groups exposed in folding intermediates. 

P22 tailspike was denatured, refolded, then applied to non- 
reducing 1-D native-PAGE. The excised lanes were incubated 
with or without 0.1 M DTT for 1 h at 24°C before loading onto 
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Figure 6. P22 tailspike monomeric intermediates were heat labile and aggregated in native gel. (A) Refolding tailspike was electrophoresed in the first 
dimension, then excised gel lane was incubated in 1 X native-PAGE running buffer on wet ice for 8 h before non-reducing 2-D native-PAGE. 
(B) The excised 1-D gel lane was incubated at 24°C for 1 h, then stored on wet ice until loaded on non-reducing 2-D native-PAGE. Mslow was decreased 
with increased Mfast/ indicated intra-disulfide bond formation, or folding collapse. (C) The 1-D excised gel lane was incubated at 24°C for 8 h before non- 
reducing 2-D native-PAGE. All monomeric species aggregated to complexes too large to enter gel. 



2-D native-PAGE (Fig. 7). As seen in Figure 7A, during the 2-D 
native-PAGE in the absence of DTT, most of the intermediates 
remained on the diagonal line with the small amount of smeared 
Mslow unlike the Mslow shift in Figure 6B. This difference 
may be caused by the 7 h incubation of 1 -D native gel on wet ice 



before 2-D native electrophoresis (Fig. 6B). Even at very low 
temperature in the native gel, monomeric intermediates may 
alter their conformations. 

In the right Figure 7B, in which the second dimension is 
run on native-PAGE after DTT treatment, a number of new 
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Figure?. The effects of DTT reduction on the various tailspike folding and assembly intermediates as monitorized by 2-D native-gel electrophoresis. 
(A) The excised gel lane from 1-D native-PAGE was incubated in native running buffer in the absence of DTT at 24°C for 1 h before 2-D native-PAGE. 
The productive folding and assembly intermediates migrated on the diagonal in the second dimension. (B) The excised gel lane from 1-D native-PAGE 
was incubated in native running buffer in the presence of 0.1 M DTT at 24°C for 1 h before 2-D native-PAGE. Native trimer formed from slower migrating 
monomeric Mslow intermediates during incubation with DTT can be seen (larger circle). Reducing induced native trimer formation from protrimer Pj 
intermediate (smaller circle). Aggregation intermediates were formed from faster migrating monomeric Mfast intermediate (triangles at the right of gel). 
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off-diagonal spots were observed (Fig. 7B). One spot appeared at 
the native trimer position (Fig. 7B, large open circle) migrating 
above Mslow species. This confirmed that the Mslow species 
was productive and proceeds to the native state in the presence 
of DTT. It also should be noted that the formation of a small 
amount of native trimer is seen below the protrimer position, as 
previously reported (Fig. 7B, small open circle). Further analy- 
sis of the samples by 2-D SDS-PAGE showed that native like 
bands were not dissociated into monomers in the presence of 
SDS, confirming the existence of native tailspike trimer (data 
not shown). This supported the interpretation that the Mslow 
intermediate represents the chain conformation needed for pro- 
ductive folding and assembly. The conversion of Mslow to native 
trimer was limited, with significant levels of Mslow species still 
remaining after 1 h DTT treatment at 24°C (Fig. 7B). 

The presence of DTT sharply reduced the level of the MpAST 
species but it was not converted to native trimer. Instead, 
aggregated multimeric species appear above the MpAST species 
(Fig. 7B, triangles on side of gel), which suggests that the MpAST 
is a disulfide bonds containing species kinetically trapped in a 
non-productive conformation. Upon reduction it proceeds down 
the aggregation pathway. 

In the position below the dimeric intermediates, Mslow 
monomeric species appeared, presumably due to dissociation of 
some dimeric intermediates rather than the productive protrimer. 
Since the productive and off-pathway dimeric intermediates were 
not resolved in the first dimension, it was difficult to interpret in 
the second dimension. 

From these results, we concluded that tailspike folding 
under reducing condition is due to the presence of a partially 
folded monomeric intermediate with critical thiols either correctly 
disulfide bonded or in some critical environment in the pro- 
ductive conformation. This thiol environment is presumably not 
maintained in the species that aggregates. 

When does the C-terminal domain become compact in the 
pathway? In the native state the cysteine annulus and triple 
[3-helix are intimately packed and symmetrically aligned (Fig. 2). 
It seemed unlikely that the early partially folded precursors 
would have this packing prior to their association and assembly. 
If the C-terminal domain is acting as an auto-chaperone, one 
may expect the protrimer be arranged in a conformation closer 
to the native tailspike. To pursue this question we compared the 
resistance of the partially folded tailspike monomeric species and 
protrimer to trypsin digestion, using mass spectroscopy to analyze 
the proteolytic fragments. 

Burying of cysteine residues in the folding of the C-terminal 
domain would be coupled with the formation of tertiary struc- 
ture. To compare cysteine reactivity in monomeric and protrimer 
folding intermediates, a two stage thiol labeling was employed, 
with initial exposure to lAA followed by reduction and 
subsequent labeling with lAM. Free thiol groups before in-gel 
trypsin digestion would be carboxymethylated with initial lAA 
treatment. Thiol groups involved in disulfide bonds and/or are 
buried in tertiary structure would not react with lAA. The species 
would be partially denatured by the organic solvent, such as 
acetonitrile for dehydration of gel, these disulfides would be 



reduced with DTT, and the resulting thiols and buried thiol 
would react with lAM during a part of in-gel digestion. 

The mass spectrometry of such labeled tryptic fragments 
identifies reactive thiols due to difference of molecular mass 
between carbamidomethylated and carboxymethylated cysteines 
in the same peptide. We will first describe the protection of the 
tailspike chain from tryptic digestion during folding and assembly, 
and then the thiol labeling results. 

Trypsin fragments of the treated samples were generated and 
identified by mass spectrometry. The native tailspike trimer is 
stable to enzymatic digestion, while its folding and assembly 
intermediates are at least partially susceptible. The summary of 
the recovery of the peptides is depicted (Fig. 8). 

As shown in Figure 4 partially folded monomeric tailspike 
wild type (WT) chains accumulated after reaction with lAA. 
For the tryptic analysis, lAA was added at 30 min of refolding 
tailspike and incubated for 2 h at 4°C. After non-reducing 1-D 
native-PAGE, the accumulated predominant monomeric band 
was excised and used for in-gel trypsin digestion. 80% of total 
peptide of WT monomeric intermediate was recovered. 

To compare with protrimer intermediate, the band corres- 
ponding to WT protrimer from the refolding reaction was 
excised from non-reducing 1-D native-PAGE. This gel band was 
incubated with lAA for 1 h to label reactive thiols in the pro- 
trimer intermediate. The gel band was applied for in-gel trypsin 
digestion in order to generate peptides from the protrimer 
intermediate. In this case only 48% of total peptide of WT 
tailspike protrimer intermediate was recovered. 

The difference in peptide recovery between monomeric and 
WT protrimer intermediates is shown using the native trimer 
structure to depict the recovered peptides (Fig. 8A, "P22 tailspike 
monomeric intermediate vs native protrimer"). Yellow peptides 
were identified in both monomeric and protrimer intermediate. 
Red peptides were recovered only from the monomeric inter- 
mediate, and thus considered protected in the protrimer. White 
peptides were not identified in either species. In comparing 
peptide recovery from protrimer with partially folded monomeric 
species, the C-terminal domain of protrimer intermediate appears 
to be protected from trypsin digestion (Fig. 8A, red color region). 

The protrimer in the in vitro refolding of wild type tailspike is 
a transient species and does not accumulate. To increase the 
concentration of this species, we used a mutant in the triple 
[3-helix domain 540-548, which accumulated protrimer inter- 
mediate.^^ In this truncated tailspike variant, AN N547Y, the 
N-terminal 108 residues are deleted. The intense protrimer band 
of AN N547Y was excised after non-reducing 1-D native-PAGE, 
then analyzed similarly to WT protrimer. 

Forty four percent of total peptide of AN N547Y protrimer 
intermediate was recovered (Fig. 8B, "P22 tailspike monomeric 
intermediate vs AN N547Y Protrimer"). There were some 
differences in peptide recovery between WT protrimer inter- 
mediate and AN N547Y protrimer. The N-terminal [3-helix near 
the head binding domain was protected in AN N547Y protrimer 
(Fig. SB, red color region). The protection of the C-terminus 
begins at the cysteine annulus in the WT protrimer, and in the 
AN N547Y extends into the triangular [3-prism region. 
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Figures. Comparison of tryptic peptide recovery between P22 tailspike nnononneric internnediates and protrinner internnediates. Yellow region represents 
peptide identified in both species. Red region represents only identified in monomer species. White region peptide could not be identified in either. 
(A) The difference in trypsin digested peptides between WT monomeric intermediate and WT protrimer. (B) The difference in trypsin digested peptide 
between WT monomeric intermediate and AN N547Y protrimer. C-terminal domain was protected in both protrimer intermediates. 



The WT protrimer intermediate and protrimer mutants 
may have some conformational differences, but both protrimers 
have stable C-terminal regions. Our interpretation is that the 
C-terminal domains of partially folded monomeric inter- 
mediates are relatively free and flexible and thus sensitive to the 
protease. Then this region begins to assemble, and collapse to 
a protease insensitive compact structure in the protrimer and 
native trimer. 

Which cysteines alter their reactivity in protrimer assembly 
step? Kim and Robinson proposed that the disulfide exchange 
of transient tailspike disulfide bonds was a key step in the 
formation of the protrimer.^^ In the auto-chaperone model these 
steps would be associated with that function, similar to the 
disulfide exchange in procollagen folding and assembly. The 
earlier studies identifying transient disulfide bonds in the C- 
terminal cysteines were unable to resolve whether they were inter- 
subunit or intra-polypeptide.^'^ 

To identify the cysteine residues participating in putative 
transient disulfides, double alkylation labeled peptides from 
monomeric species and AN N547Y protrimer in prior experi- 
ment were compared (Table 1). From mass spectrometry of the 
lAA treated monomeric intermediate, three carboxymethylated 
cysteines C496, C613 and C635 were identified, these were in 
peptides in 484-497, 571-615 and 632-648 (columns 1 and 4 
of Table 1). We used negatively charged modification to label 
exposed cysteines before the tryptic digestion. The negatively 
charged (lAA labeled) cysteine side chains may partially block 
trypsin access to the positively charged cleavage sites (K607 and 
K611). As a result, lAA labeled cysteine containing peptide were 
probably not completely digested into small peptides, and we 



could identify as a long peptide. We could not find (K^o?) 
VNHK^iiDCR peptide containing one uncleaved site. Thus, lAA 
labeled cysteine inducing local structural alteration would involve 
both K607 and K611. The two peptides containing cysteine 613 
and cysteine 635 are located in the C-terminal cysteine annulus, 
and peptide 484-497 containing cysteine 496 is at the end at 
[3-helix (column 2 of Table 1). Sather and King confirmed 
reactivity of monomeric intermediate cysteines 496, 613 and 635 
to lAM in vivo.^^ 

Two lAM labeled cysteines C169 in 164-170 and C267 in 
266-274 at (3-helical domain were recovered (column 1 and 4 
of Table 1). CI 69 and C267 are among the stacked side chains 
that appear to fold early and rapidly. ^^'^^ The labeling of these two 
cysteines by lAM indicates they were disulfide bonded in the 
monomeric intermediate, or were already buried and unreactive 
in early folding intermediates. 

Two predicted peptides, 286-291 from [3-helix domain con- 
taining two cysteines C287 and C290, and 447-483 from near 
the end of N-terminal [3-helix domain containing C458 were not 
detected (column 4 of Table 1). Subsequently, endoproteinase 
Asp-N or Glu-C was used for the same experiment to generate 
appropriate length cysteine containing peptides. However, neither 
peptide was recovered (data not shown). Since the [3-helix forms 
rapidly even in the partially folded monomeric intermediate,^^ the 
cleavage sites flanking these peptides may be inaccessible to 
proteolysis. 

These results indicated that three cysteines in partially folded 
monomeric intermediates; C496 at the end of [3-helix region, 
C613 and C635 in the C-terminal domain are free and reactive, 
while other cysteines are not reactive. 
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Table 1. Modification of cysteines in WT monomeric intermediate and AN N547Y protrimer mutants measured by double alkylation with mass spectrometry 



Cysteine containing 
peptide 


Structure 


Tryptic peptide 


Modification in 
WT monomeric 


Modification in AN 
N547Y protrimer 


164-170 


p-helix 


VLTIECifioK 


lAM 


X 


266-274 


P-helix 


EC267IGVEVHR 


lAM 


lAM 


286-291 


P-helix 


GC287H FC290K 


X 


X 


1 447-483 


p-helix 


GMYVSNITVEDC458AGSGAYLLTHESVFTNIAIIDTNTK 


X 


X 


484-497 


p-helix 


DFQANQIYISGAC496R 


lAA 


lAA-HlAM 


571-615 


C-terminal annulus 


TLDSGALYSHINGGAGSGSAYTQLTAISGSTPDAVSLKVNHKDC613R 


IAA+ 


X 


632-648 


C-terminal annulus 


DSSC635FLPYWENNSTSLK 


lAA 


X 



^The modification of C613 was identified only in monomeric intermediates 571-615 peptide containing uncleaved sites. 



We proceeded to assess the state of cysteines in the AN N547Y 
protrimer mutant. For this species only two of seven cysteine 
containing peptides were recovered, one containing cysteines 
C267 and C496, (column 5 of Table 1). The three cysteines 
(CI 69, C6I3 and C635) containing peptides recovered from 
monomeric intermediate were not recovered. This presumably 
reflects the protective folded and partial assembled state of the 
chains in the protrimer state. The peptide containing C496, 
resulted in a mixture with some chains labeled by lAA, and some 
by lAM. The C496 thiol side chains labeled by lAA were 
presumably exposed in initial conformation, while the C496 
thiols labeled by lAM were partially buried in the protrimer 
intermediate. This suggests that the lower region of the [3-helical 
domain in the protrimer was not locked in the native like con- 
formation, but maybe more flexible. As noted above, the peptides 
containing cysteines C613 and C635 were not recovered, reflect- 
ing the compact conformation in both the WT and AN N547Y 
(Fig. 8, and column 5 of Table 1). This result suggested that 
the sequences surrounding these two cysteines were buried or 
otherwise protected within the protrimer intermediate. 

The reactive C-terminal annulus thiols in the monomeric 
intermediate, C613 and C635 were labeled with lAA, but were 
protected in the protrimer. These results indicated the C-terminal 
domain of tailspike polypeptide chain undergoes significant 
conformational change prior to the assembly into the protrimer, 
consistent with this region having an auto-chaperone function as 
established for 029 appendage and KIF tailspike. 

Discussion 

The C-terminal region of tailspike of P22 is a very complex 
topological fold, with its interdigitated regions, triple [3-helix, 
cysteine annulus, and triangular bladed motif (Figs. 1 and 2). 
The P22 tailspike in vivo folding and assembly pathway has a ti/2 
of 5 min at 30°C, indicating complex kinetic processes. Inspec- 
tion of the C-terminal triple (3-helix, triangular [3-prism and 
cysteine annulus regions suggests that formation of these intim- 
ately intertwined domains may pose kinetic and/or thermodyn- 
amic barriers. Isolated (3-helical (residue 109-544) domains do 
not form stable trimers but are in reversible trimer-monomer 
equilibrium.^^ It seems likely that this wrapped feature of the 
C-terminal of the tailspike is responsible for the more than 40°C 



increase in thermostability in the transition from the protrimer to 
the native trimer.^"^'^^ 

The discovery that related adhesins have auto-chaperone 
domains (Fig. 1) provides a unifying model; these regions fold 
early and chaperone the chain assembly process to produce the 
intertwined domain with correct chain register. The presence of 
unusually reactive cysteine residues and perhaps transient disulfide 
bonds in the C-terminus presumably reflects some distinct local 
conformation that reflects the auto-chaperone function required 
for its in-register assembly and chain interdigitation. 

P22 tailspike C-terminal domain as auto-chaperone in 
protrimer folding intermediate. The native tailspike trimer is 
completely resistant to tyrpsin digestion. However, the C-terminal 
region of the partially folded monomeric intermediate was 
susceptible to trypsin digestion. In contrast the same regions 
were protected in the two protrimer intermediates tested. This 
suggests that the protrimer intermediates have trimer-like folded 
C-terminal domains, while the N-terminal domains are not fully 
assembled. This is consistent with an auto-chaperone function 
within the C-terminal sequences directing the chains into the 
protrimer configuration. 

In this auto-chaperone model the C-terminal interdigitated 
region forms early in protrimer formation. Our previous descrip- 
tion of this region serving as a molecular clamp, implied 
C-terminal association as a late step in protrimer formation.^^ 
From results reported here we conclude trimer assembly is 
initiated from the C-terminal domains, directing trimer assembly 
similar to the role to the C-terminal registration peptides in pro- 
collagen triple helix precursor folding (Fig. 9).^^'^^^ 

Diversity of auto-chaperone functions. The tri-bladed motif at 
C-terminal domain of P22 tailspike is similar in conformation to 
the other tailspike superfamily C-terminal auto-chaperone 
domains, though without sequence homology. ^^'^^ The P22 
auto-chaperone domain remains part of mature protein after 
trimer folding, while those of 029 and KIF are not. Why these 
tailspike are different in terms of proteolysis is unclear. 

The IMC/auto-chaperone functions have been reported in 
both C-terminal and N-terminal domains of many proteins, and 
for bacterial monomeric, or trimeric autotransporter proteins. ^^'^^ 
As noted earlier, for Bordetella pertussis monomeric autotran- 
sporter, the type I IMC/auto-chaperone domain at the end of 
N-terminal [3-helix helps N-terminal "passenger domain" folding 
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Figure 9. The folding and aggregation pathway of P22 tailspike adhesin. This schematic diagram depicts the C-terminal auto-chaperone directing 
productive folding and assembly. In the pathway each intermediates identified on 2-D gel are represented. The productive slower migrating monomeric 
(MsLow) intermediate has reactive cysteine [S]^• the non-productive faster migrating monomeric (Mfast) intermediate may have intra-disulfide bond, 
or be more collapse; the productive dimeric and protrimer intermediates contain inter-disulfide bond [-S-S-] formed from reactive cysteines in Mslow 
intermediates. The aggregation prone off-pathway species (*) do not contain productive intramolecular disulfide bonds. 
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upon export to the cell surface. ^^'^^ On the other hand, both 
Hemophilus influenzae Hia and Yersinia enterocolitia YadA 
adhesin trimeric auto-chaperone domains direct trimer assembly 
from the end of triple N-terminal [3-helix into head binding 
domain, as type II IMCs. These sequences remain associated and 
are not cleaved off ^^'^^ 

Many auto-chaperones and pro-peptides utilize transient 
disulfide bonds. The pro-peptide of Nerve growth factor (NGF) 
stabilizes folding intermediates to induce correct disulfide bond 
formation. The C-terminal pro-peptide of collagen, which 
initiates quaternary assembly of the collagen triple helix are 
stabilized by intermolecular disulfide bonds. ^^'^^ The pro-peptide 
of von Willebrand factor (VWF) mediates disulfide bridge for- 
mation to complete VWF dimer folding.^^ Most reported IMCs 
have been classified by mode of folding as type I or type II, but 
structural variations suggests the presence of other types of IMC 
classifications in protein folding intermediates. ^^'^^"''^ 

Are transient disulfide bonds physiological or artifactual? A 
variety of experiments indicate a role for transient disulfide bonds 
in protrimer folding and assembly.^^'^^ However, it has been 
difficult to ascertain if these are essential for efficient folding and 
assembly, or a byproduct of the slow folding reaction, and the 
proximity of the reactive 613 and 635 cysteines. The observation 
that double C613S/C635S are defective in folding can be 
interpreted as either supporting a role for transient disulfide 
bonds, or a role for the reactive thiol in maintaining some local 
critical non-disulfide bonded conformation. There are many 
examples of reactive cysteines in cytosols that transiently stabilize 
particular conformations. Li et al. has reported transient disulfide 
bonds-related subunits assemble in capsid protein of simian virus 



40 7^ Those transient disulfide bonds are well organized in their 
pentamer formation under the reducing environment of the 
cytosol. In those proteins folding intermediates, productive intra- 
disulfide bonds are transferred into inter-disulfide bonds. P22 
tailspike folding intermediates in the cytosol exhibit a similar 
mechanism between non-productive intra-disulfide bond contain- 
ing monomer and productive inter-disulfide bond containing 
protrimer. 

Regardless of the exact mechanism by which the C-terminal 
cysteine thiols direct the chain into the productive conformation, 
the interpretation that these steps are part of an auto-chaperone 
function as demonstrated in the KIF and 029 tailspikes, make 
it much easier to understand how these chains reach their 
intertwined highly stable native state. 

Materials and Methods 

Expression and purification of recombinant proteins. Wild type 
(WT), and N-terminal truncated (AN) N547Y of P22 tailspike 
were recombinantly expressed and purified from E. coli as 
previously described.^^'^^ The purity of proteins was checked by 
SDS-PAGE. All protein concentrations were calculated by using 
A280 nm = 0.98.^^ 

Urea unfolding and refolding. P22 tailspike at 2.0 mg/mL was 
unfolded in acidic urea buffer (5 M urea, 50 mM sodium citrate 
and 2 mM EDTA, pH 3.0) for 1 h at 24°C. Unfolded tailspike 
was diluted to 0.1 mg/mL into refolding buffer (0.58 M urea, 
100 mM Tris and 2 mM EDTA, pH 7.6) and incubated for 
15 min on wet ice to accumulate productive folding inter- 
mediates.'^^ Refolding was performed as specified. 
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Alkylation labeling for refolding P22 tailspike. Addition of 
0.1 M lAA stock solution to refolding P22 tailspike at each 
indicated time and further incubation was performed in the dark 
for a total of 1 h at 24°C. To prepare samples for non-reducing 
2-D native-PAGE, lAA stock solution was added after 2 h 
refolding tailspike at 4°C, then incubated further for 1 h in dark. 
The final concentration of lAA was 30 mM for all experiments. 
At the end of labeling, all samples were mixed with ice-cold 3 X 
native-PAGE sample buffer (14 mM Tris-base, 109 mM glycine, 
30% (v/v) glycerol, 0.1% bromophenol blue), and kept in wet ice 
until loaded into native-PAGE. 

Double alkylation for mass spectrometry. To obtain alkylated 
peptides of monomer intermediates, lAA stock solution was 
added to 30 min refolding tailspike reaction and incubated for 
additional 2 h at 4°C. The final concentration was 30 mM lAA. 
After non-reducing 1-D native-PAGE, coomassie blue stained 
monomeric band was excised, reduced and alkylated with lAM 
followed by in-gel digestion methods with trypsin, Asn-N and 
Glu-C. 

For protrimer alkylation, 2 h refolding WT, and AN N547Y 
were loaded into non-reducing 1-D native-PAGE. After electro- 
phoresis, excised protrimer bands were immediately incubated 
in native-PAGE running buffer with 30 mM lAA for 1 h at 4°C 
in the dark, then applied for in-gel digestion and mass spectro- 
metry analysis together with monomeric bands. Tryptic peptides 
were identified with QSTAR Elite mass spectrometer (Applied 
Biosystems) installed at the Koch Institute Proteomics lab (http:// 
web.mit.edu/ki/facilities/core.html#biopolymers). 



Reducing/non-reducing 2-D PAGE. To investigate heat 
stability and thiol group reactivity, refolded, and alkylated P22 
tailspike was electrophoresed through non-reducing 1-D native- 
PAGE. Refolding and alkylation were performed as specified. 
After 1-D native-PAGE, the excised lane was incubated in native- 
PAGE running buffer in the presence, or absence of 0.1 M DTT 
for indicated time and temperature. After incubation, gel was 
loaded onto the 2-D native-gel. 

Gel electrophoresis. All electrophoresis were performed as 
previously described.^^ The concentrations of each acrylamide/ 
bisacrylamide in the resolving gels were 9% for native-PAGE. The 
concentrations of stacking gels were 3.6% for native-PAGE. All 
reducing and non-reducing 2-D PAGE were performed with the 
Bio-Rad Criterion cassette system (Bio-Rad). Native-PAGE was 
performed at 150 V with a constant current of 15 mA/gel for 4 h 
in a 4°C cold room. All proteins separated by PAGE were 
visualized usins Krypton''^' (PIERCE), and imaged on a 
Typhoon™ Variable Mode Imager (GE healthcare). 
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